Rank in Wordlist | Frequency | Word |
---|---|---|
3258 | 49964 | 1,5 |
5030 | 32648 | 2,5 |
5868 | 27862 | 0,5 |
8443 | 19120 | 3,5 |
11967 | 13158 | 1,2 |
13206 | 11848 | 4,5 |
16112 | 9406 | 1,6 |
16716 | 9013 | 1,3 |
17111 | 8774 | 1,8 |
17434 | 8584 | 7,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3822788 | 2 | 1-2/5''(3.5cm |
4130188 | 2 | Ilgis-68''(173cm |
5403585 | 1 | .( |
5432210 | 1 | 1)80%,(2)80%,(3)80% |
5442403 | 1 | 1.5''(0220-704315 |
5545890 | 1 | 18,7%(-2% |
5592382 | 1 | 2-1-name''(DRL |
5677851 | 1 | 3''(7.5cm |
5749860 | 1 | 43''(110 |
5776848 | 1 | 500-75(15%)-45(9% |
Rank in Wordlist | Frequency | Word |
---|---|---|
201437 | 331 | .) |
3090289 | 3 | .-) |
5432210 | 1 | 1)80%,(2)80%,(3)80% |
5736283 | 1 | 4-6%).70% |
5776848 | 1 | 500-75(15%)-45(9% |
6462485 | 1 | FILM)''A |
7701209 | 1 | STAG-150''),valdančiu |
8532891 | 1 | baužiukais''),ir |
8672901 | 1 | cementovkė''),arba |
8723548 | 1 | daiva'':)Jei |
Rank in Wordlist | Frequency | Word |
---|---|---|
4903 | 33395 | 100% |
10637 | 14978 | 50% |
10904 | 14564 | 10% |
12962 | 12080 | 20% |
13070 | 11979 | 5% |
13419 | 11625 | 30% |
17430 | 8585 | 2% |
17682 | 8453 | 80% |
19118 | 7734 | 90% |
20258 | 7249 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
58893 | 1947 | G&G |
65982 | 1670 | S&P |
69053 | 1570 | H&M |
110962 | 804 | B&B |
123770 | 685 | R&B |
141889 | 560 | R&D |
168222 | 436 | P&C |
176577 | 405 | AT&T |
190355 | 361 | D&G |
218904 | 292 | Cooper&Hunter |
Rank in Wordlist | Frequency | Word |
---|---|---|
690203 | 45 | A$AP |
970087 | 25 | Nor$dami |
1632937 | 10 | P$G |
1724840 | 9 | Ke$ha |
2050955 | 7 | i$jungti |
2098653 | 7 | si$lo |
2125925 | 6 | 10$/mėn |
2125989 | 6 | 10/$20 |
2268738 | 6 | mar$ruto |
2303049 | 6 | pri$mimo |
Rank in Wordlist | Frequency | Word |
---|---|---|
2024 | 76854 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
50291 | 2400 | .' |
66297 | 1659 | King's |
87543 | 1133 | Victoria's |
93370 | 1031 | Bishop's |
101985 | 910 | George'as |
120062 | 716 | Facebook'e |
132011 | 623 | George'o |
133131 | 615 | Mike'as |
139398 | 575 | McDonald's |
171062 | 425 | Fontaine-l'Évêque |
Rank in Wordlist | Frequency | Word |
---|---|---|
135751 | 598 | 2+1 |
145597 | 539 | 2+2 |
157467 | 480 | M+S |
176571 | 405 | 1+1 |
199898 | 335 | HERTH+BUSS |
249981 | 238 | Quashqai+2 |
276755 | 203 | n+k |
291010 | 187 | 5+1 |
299279 | 179 | 3+1 |
308611 | 171 | IBAD+0plBAD+Bstos |
Rank in Wordlist | Frequency | Word |
---|---|---|
5415153 | 1 | 000*50%*28.5% |
5598260 | 1 | 20%*12% |
5770002 | 1 | 5%*88% |
Rank in Wordlist | Frequency | Word |
---|---|---|
5026 | 32662 | https://www |
9106 | 17591 | ir/ar |
9507 | 16788 | km/h |
12605 | 12443 | cthttp://www |
17653 | 8470 | mg/kg |
18036 | 8263 | ir/arba |
19057 | 7759 | 1/2 |
21408 | 6801 | km/val |
21421 | 6796 | http://www |
21963 | 6609 | 2/3 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots